The Performance of the Intel TFLOPS Supercomputer

نویسندگان

  • Greg Henry
  • Timothy G. Mattson
چکیده

The purpose of building a supercomputer is to provide superior performance on real applications. In this paper, we describe the performance of the Intel TFLOPS Supercomputer starting at the lowest level with a detailed investigation of the Pentium® Pro processor and the supporting memory subsystem. We follow this with a description of the benchmarks used to track the performance of the machine over its development life cycle, which culminated in the first MP LINPACK run to exceed a rate of one trillion floating point operations per second (TFLOPS). Our analysis applies not only to the TFLOPS supercomputer, but also to servers and workstations based on the Intel 32-bit architecture. We conclude with a discussion of the machine's performance on a production application.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Overview of the Intel TFLOPS Supercomputer

Computer simulations needed by the U.S. Department of Energy (DOE) greatly exceed the capacity of the world’s most powerful supercomputers. To satisfy this need, the DOE created the Accelerated Strategic Computing Initiative (ASCI). This program accelerates the development of new scalable supercomputers and will lead to a supercomputer early in the next century that can run at a rate of 100 tri...

متن کامل

Scalable Platform Services on the Intel TFLOPS Supercomputer

This paper describes Scalable Platform Services (SPS)—a collection of software providing the manageability solution for Intel’s latest parallel processing supercomputer. Compared to previous generations of supercomputer management environments, such as that of the Intel Paragon Supercomputer, the SPS makes significant strides in feature offerings and overall usability. The SPS consists of dist...

متن کامل

Achieving Large Scale Parallelism Through Operating System Resource Management on the Intel TFLOPS Supercomputer

From the point of view of an operating system, a computer is managed and optimized in terms of the application programming model and the management of system resources. For the TFLOPS system, the problem is to manage and optimize large scale parallelism. This paper looks at the management in terms of three key topics: memory management, communication, and input/output. For memory management, we...

متن کامل

Lattice QCD on Intel R © Xeon Phi TM coprocessors

Lattice QuantumChromodynamics (LQCD) is currently the only known model independent, non perturbative computational method for calculations in the theory of the strong interactions, and is of importance in studies of nuclear and high energy physics. LQCD codes use large fractions of supercomputing cycles worldwide and are often amongst the first to be ported to new high performance computing arc...

متن کامل

Nankai Stars : an example of designing , constructing , evaluating , and applying a 5 - Tflops Beowulf supercomputer

This paper presents the design considerations, evaluation methods and a scoring system developed to create a modern high-performance computing facility for Nankai Institute for Scientific Computing (ISC). The facility consists of Nankai Stars a scalable massively parallel cluster supercomputer and a supporting laboratory. Nankai Stars, design is suitable for, a wide range of applications that e...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998